摘要 :
Performance bugs are known to be a major threat to the success of software products. Performance tests aim to detect performance bugs by executing the program through test cases and checking whether it exhibits a noticeable perfor...
展开
Performance bugs are known to be a major threat to the success of software products. Performance tests aim to detect performance bugs by executing the program through test cases and checking whether it exhibits a noticeable performance degradation. The principles of mutation testing, a well-established testing technique for the assessment of test suites through the injection of artificial faults, could be exploited to evaluate and improve the detection power of performance tests. However, the application of mutation testing to assess performance tests, henceforth called performance mutation testing (PMT), is a novel research topic with numerous open challenges. In previous papers, we identified some key challenges related to PMT. In this work, we go a step further and explore the feasibility of applying PMT at the source-code level in general-purpose languages. To do so, we revisit concepts associated with classical mutation testing and design seven novel mutation operators to model known bug-inducing patterns. As a proof of concept, we applied traditional mutation operators as well as performance mutation operators to open-source C++ programs. The results reveal the potential of the new performance-mutants to help assess and enhance performance tests when compared with traditional mutants. A review of live mutants in these programs suggests that they can induce the design of special test inputs. In addition to these promising results, our work brings a whole new set of challenges related to PMT, which will hopefully serve as a starting point for new contributions in the area.
收起
摘要 :
The performance of large-scale systems must be thoroughly tested under various
levels of workload, as load-related issues can have a disastrous impact on the system.
However, load testing often requires a large amount of time, r...
展开
The performance of large-scale systems must be thoroughly tested under various
levels of workload, as load-related issues can have a disastrous impact on the system.
However, load testing often requires a large amount of time, running from hours to
even days. In our prior work, we reduced the execution time of a load test by
detecting repetitiveness in individual performance metric values, such as CPU
utilization, that are observed during the test. However, as we explain in this paper,
disregarding combinations of performance metrics may miss important information
about the load-related behavior of a system. In this paper we revisit our prior
approach, by proposing an approach that reduces the execution time of a load test
by detecting whether a test no longer exercises new combinations of the observed
performance metrics. We study three open source systems, in which we use our new
and prior approaches to reduce the execution time of a 24-hour load test. We show
that our new approach is capable of reducing the execution time of the test to less
than 8.5 hours, while preserving a coverage of at least 95% of the combinations that
are observed between the performance metrics during the 24-hour tests.
收起
摘要 :
Performance bugs are common, costly, and elusive. Performance tests aim to detect performance bugs by running the program with specific inputs and determining whether the observed behaviour is acceptable. There not exist mechanism...
展开
Performance bugs are common, costly, and elusive. Performance tests aim to detect performance bugs by running the program with specific inputs and determining whether the observed behaviour is acceptable. There not exist mechanisms, however, to assess the effectiveness of performance tests. Mutation testing is a technique to evaluate and enhance functional test suites by seeding artificial faults in the program under test. In this new idea paper, we explore the applicability of mutation testing to assess and improve performance tests. This novel approach is motivated with examples and open questions.
收起
摘要 :
In this paper, we propose the towards virtualized and automated software performance test architecture. In general, test engineers use the public performance testwares such as Load Runner, Silk Performer to validate the performanc...
展开
In this paper, we propose the towards virtualized and automated software performance test architecture. In general, test engineers use the public performance testwares such as Load Runner, Silk Performer to validate the performance efficiency of their own systems. In case that they do not allowed to use the performance testwares due to the technical limitations in the testwares, most testers should perform the testing in manually. According to the waste of computer and human resources resulted from the situation, we need to propose the test automation scheme by using the virtualization technology to prevent the dissipation in the test environment which has limited resources. The system architecture considered efficient usage of computer resources and test automation to reduce human acts are addressed mainly in this paper. we describe our proposed method which deals with the system architecture and test automation procedures. In our system architecture, we will show how to use the virtual machines and the types of the virtual machines for performance measurement. In addition, the six steps of the test automation are introduced for the automated testing procedures. Finally, a number of experiments show that the proposed schemes allow offering the possibility for automated software performance testing by using the virtualization.
收起
摘要 :
The Ajax model of Web applications development has rapidly gained popularity because it promises to bring the richness and responsiveness of desktop applications to the Web. Ajax implementations differ fundamentally from other Web...
展开
The Ajax model of Web applications development has rapidly gained popularity because it promises to bring the richness and responsiveness of desktop applications to the Web. Ajax implementations differ fundamentally from other Web implementations--mainly in making asynchronous requests for parts of a webpage. Techniques routinely used for performance testing traditional Web applications must be modified and enhanced to suit the needs of Ajax-based applications. Using a general example, the authors of this article examine the unique challenges of carrying out performance testing for Ajax-based applications and offer approaches and tools for overcoming them.
收起
摘要 :
Abstract The Erlang programming language is used to build concurrent, distributed, scalable and resilient systems. Every component of these systems has to be thoroughly tested not only for correctness, but also for performance. Pe...
展开
Abstract The Erlang programming language is used to build concurrent, distributed, scalable and resilient systems. Every component of these systems has to be thoroughly tested not only for correctness, but also for performance. Performance analysis tools in the Erlang ecosystem, however, do not provide a sufficient level of automation and insight needed to be integrated in modern tool chains. In this paper, we present PerformERL : an extendable performance testing framework that combines the repeatability of load testing tools with the details on how the resources are internally used typical of the performance monitoring tools. These features allow PerformERL to be integrated in the early stages of testing pipelines, providing users with a systematic approach to identifying performance issues. This paper introduces the PerformERL framework, focusing on its features, design and imposed monitoring overhead measured through both theoretical estimates and trial runs on systems in production. The uniqueness of the features offered by PerformERL , together with its usability and contained overhead prove that the framework can be a valuable resource in the development and maintenance of Erlang applications.
收起
摘要 :
'Georgia-10T' (Reg. No. CV-113, PI 660315) is a high-yielding, large-seeded, runner-type peanut (Arachis hypogaea L. subsp. hypogaea var. hypogaea) cultivar that is resistant to Tomato spotted wilt virus and that was released by t...
展开
'Georgia-10T' (Reg. No. CV-113, PI 660315) is a high-yielding, large-seeded, runner-type peanut (Arachis hypogaea L. subsp. hypogaea var. hypogaea) cultivar that is resistant to Tomato spotted wilt virus and that was released by the Georgia Agricultural Experiment Stations in 2010. It was developed at the University of Georgia, Coastal Plain Experiment Station, Tifton, GA. Georgia-10T originated from a cross made between 'Georgia-02C' and 'Georgia-01R'. Pedigree selection was practiced within the early-segregating generations. Performance testing began in the F-4:6 generation with the advanced pure-line selection, GA 052529, which was the experimental designation of Georgia-10T. During three-years (2007-2009) averaged over 20 multilocation tests in Georgia, Georgia-10T had significantly less midseason TSWV incidence and late-season total disease incidence, higher yield, grade, and dollar value return per hectare compared with Georgia-01R. Georgia-10T is most similar to Georgia-01R and should be an excellent cultivar for an early-planting date option in the southeast U.S. peanut production area.
收起
摘要 :
After the reorganization of the performance test of stallions in 2016, key elements of the German horse breeding are the 14-days performance test, the stationary stallions performance test and the sport test. The aim of this study...
展开
After the reorganization of the performance test of stallions in 2016, key elements of the German horse breeding are the 14-days performance test, the stationary stallions performance test and the sport test. The aim of this study is the evaluation and analysis of traits measured within the performance tests of stallions. Analyses are based on records of 1161 stallions from 2016 until May 2020. All stallions were evaluated at multiple time points during the performance test. The comparison of first evaluation and final score revealed changes in the ranking of the stallions with longer testing periods. The analysis of repeated measurements of four- and five-year-old stallions within the sport test confirmed the need for testing at different ages and performance levels. Most of the traits showed a moderate to high heritability. To some extent, traits and trait complexes indicate positive and negative genetic correlations. Especially the trait gallop points out to different possibilities of breeding-based consideration. The reorganized test system, with its diverse and discipline-specific forms of performance tests, ensures to be able to address these factors properly.
收起
摘要 :
It has been shown that inconsistently applied normalization for body size could be an important methodological problem in testing physical performance in areas such as sports, physical education, ergonomy, or physical medicine and...
展开
It has been shown that inconsistently applied normalization for body size could be an important methodological problem in testing physical performance in areas such as sports, physical education, ergonomy, or physical medicine and rehabilitation. The aim of this study was to evaluate a part of the recently proposed classification of physical performance tests (Jaric 2003) based on the role of body size in the tested performance. Presuming a normalization method P(n)= P/ S(b) based on an allometric relationship between the tested performance P and a selected index of body size S (P(n) performance normalized for body size; b allometric parameter), we specifically hypothesized that: (1) the tests of exertion of external force (e.g., lifting weight, pushing, pulling), (2) tests of rapid movements (jumping, sprinting, kicking) and (3) tests of supporting body weight (chin-ups, squats) would reveal the values of the allometric parameters b=0.67, b=0 and b=-0.33 when body size is expressed as body mass, or b=2, b=0 and b=-1 when body size is expressed as body height, respectively. Male physical education students (n=77) were tested on 18 standard physical performance tests belonging to the aforementioned three groups. The obtained values of the allometric parameters proved to be closely in line with the hypothesized ones. While the finding regarding the tests of exertion of external force (i.e., the tested force should be divided by m(0.67) in order to normalize the force for body size) have been already demonstrated by some authors, the findings related to the tests of rapid movements and, particularly, tests of supporting body weight are novel. Although the normalization methods discussed need further evaluation, a more accurate and consistently applied assessment of the body size-independent indices of the evaluated groups of standard tests could improve the methodology of physical performance testing in general.
收起
摘要 :
A reduction in executive function (EF) performance is a major factor associated with the loss of functional independence among older adults. Computer-based tests are commonly used to evaluate EF; however, these mouse or keyboard t...
展开
A reduction in executive function (EF) performance is a major factor associated with the loss of functional independence among older adults. Computer-based tests are commonly used to evaluate EF; however, these mouse or keyboard tests are upper limb dominant while most activities of daily living (ADL, e.g. crossing a street) are lower limb dominant. The purpose of this study was to examine the utility of a newly developed walking EF test called the Walking Response and Inhibition Test (WRIT). The WRIT was validated by comparing its results a number of established computer-based tests and to an ADL-related test known to require EF, the Timed "Up & Go" Test (TUG). Fifty healthy adults, ranging in age from 50 to 86 years (mean +/- SD, 65.5 +/- 9.6 y) were evaluated using the WRIT, three computer-based EF tests, the TUG, a verbal memory test and an agility test. All computer-based EF tests were positively correlated to the WRIT (p < 0.05); however, regression analyses revealed that the WRIT explained 37.5% of the variance in the TUG, while a composite of traditional computer based tests explained 10.5%. As indicated by Lin's Concordance reliability (pc = 0.82) between testing days was high and was supported by a Cronbach's alpha of 0.90. Bland-Altman analyses also demonstrated good agreement between the testing days with a small mean difference 3.48 (-3.71, 10.67). These results support the validity and reliability of the WRIT, and indicate that when assessing EF as it relates to functionality, the WRIT test may be a more appropriate measure than existing computer-based mouse and keyboard tests. (C) 2017 Elsevier Inc. All rights reserved.
收起